TrackSign-labeled web tracking dataset
نویسندگان
چکیده
Recent studies [8] show that more than 95% of the websites available on Internet contain at least one so-called web tracking systems. These systems are specialized in identifying their users by means a plethora different methods. Some them (e.g., cookies) very well known most users. However, percentage including "obscure" and privacy-threatening systems, such as fingerprinting methods user's computer, is constantly increasing. Detecting those today's difficult, almost any website modifies its content dynamically minimizes code order to speed up loading times. This minimization dynamicity render unreadable humans. Thus, research community looking for new ways discover unknown running under hood. In this paper, we present dataset containing information 76 million URLs 45 online resources, extracted from 1.5 popular websites. The labeling process was done using state-of-the-art discovery algorithm called TrackSign [8]. also contains about security relation between domains, loaded URLs, resource behind each URL. can be useful kinds experiments, locating threats, or determining characteristics URL network graph.
منابع مشابه
SLAC: A Sparsely Labeled Dataset for Action Classification and Localization
This paper describes a procedure for the creation of large-scale video datasets for action classification and localization from unconstrained, realistic web data. The scalability of the proposed procedure is demonstrated by building a novel video benchmark, named SLAC (Sparsely Labeled ACtions), consisting of over 520K untrimmed videos and 1.75M clip annotations spanning 200 action categories. ...
متن کاملLinked Web APIs Dataset Web APIs meet Linked Data
Web APIs enjoy significant increase in popularity and usage in the last decade. They have became the core technology for exposing functionalities and data. Nevertheless, due to the lack of semantic Web API descriptions their discovery, sharing, integration, and assessment of their quality and consumption is limited. In this paper, we present the Linked Web APIs dataset, an RDF dataset with sema...
متن کاملLinked Web APIs Dataset Web APIs meet Linked
Web APIs enjoy a significant increase in popularity and usage in the last decade. They have become the core technology for exposing functionalities and data. Nevertheless, due to the lack of semantic Web API descriptions their discovery, sharing, integration, and assessment of their quality and consumption is limited. In this paper, we present the Linked Web APIs dataset, an RDF dataset with se...
متن کاملLODStats: The Data Web Census Dataset
Over the past years, the size of the Data Web has increased significantly, which makes obtaining general insights into its growth and structure both more challenging and more desirable. The lack of such insights hinders important data management tasks such as quality, privacy and coverage analysis. In this paper, we present the LODStats dataset, which provides a comprehensive picture of the cur...
متن کاملEvaluation of Model based Tracking with TrakMark Dataset
We benchmark two tracking methods developed in the INRIA Lagadic team with a TrakMark dataset. Since these methods are based on a 3D model based approach, we selected a dataset named “Conference Venue Package 01” that includes a 3D textured model of a scene. For the evaluation. we compute the error of 3D rotation and translation with the ground truth transformation matrix. Through these evaluat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Networks
سال: 2023
ISSN: ['1872-7069', '1389-1286']
DOI: https://doi.org/10.1016/j.comnet.2023.109687